Lesson: Configuring the Match Transform 


Match Criteria Options 


The majority of your data standardization should take place in the address cleansing and Data 
Cleanse transforms. However, there are a few preprocessing options specific to only the 
match process that can provide more accurate matching, which can be defined in the Match 
Editor. These options include removing punctuation, converting to upper case, converting 
diacritical characters, and converting text to numbers. 


Phonetic and Unicode Data Match 


There are instances where using phonetic data can produce more matches when used as a 
criterion, than if you were to match on other criteria such as name or firm data. For example, 
the names Smith and Smythe are only 72% similar when you match based on the name field, 
but are a 100% match when you match phonetically. 


Table 44: Matches for Smith and Smythe Based on the Name Field 


Name Comparison score 


Smith 72% 





Smythe 


Table 45: Matches for Smith and Smythe Based on Phonetic Data 


To match on phonetic data, use the Double Metaphone or Soundex functions to populate a 
field and use it for creating break groups or use it as a criterion in matching. 





If you intend to match on phonetic keys, set the criteria options as follows: 


Table 46: Phonetic Key Criteria Options 
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Set the match score options as follows: 





Table 47: Phonetic Key Match Score Options 
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